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Abstract 

Amyloid fibers are aggregates of proteins. They are built out of a peptide called /3-aniyloid 
(A/3) containing between 41 and 43 residues, produced by the action of an enzyme which cleaves a 
much larger protein known as the Amyloid Precursor Protein (APP). X-ray diffraction experiments 
O^l have shown that these fibrils are rich in /3-structures, whereas the shape of the peptide displays 

an a~helix structure within the APP in its biologically active conformation. A realistic model 

> ■ 

Qs ■ of fibril formation is developed based on the seventeen residues A/312-28 amyloid peptide, which 

(N : 

f^ . has been shown to form fibrils structurally similar to those of the whole A/3 peptide. With the 

<^ \ help of physical arguments and in keeping with experimental findings, the A/312~28 monomer is 

^^ ■ assumed to be in four possible states (i.e., native helix conformation, /3-hairpin, globular low-energy 
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qh| and octamers) were constructed. With the help of short, detailed Molecular Dynamics (MD) 
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state and unfolded state). Making use of these monomeric states, oligomers (dimers, tertramers 



calculations of the three monomers and of a variety of oligomers, energies for these structures were 
obtained. Making use of these results within the framework of a simple yet realistic model to 
describe the entropic terms associated with the variety of amyloid conformations, a phase diagram 
can be calculated of the whole many-body system, leading to a thermodynamical picture in overall 
agreement with the experimental findings. In particular, the existence of micellar metastable states 
seem to be a key issue to determine the thermodynamical properties of the system. 



Amyloid fibers are aggregates of proteins or of fragments of proteins displaying rod-like 
shape. The mechanism which trigger proteins to leave their biologically active conformation 
and aggregate is, in general, unknown. 

The formation of amyloid fibres^ from monomeric proteins moved to the centre of the 
scientific stage when it was found to be associated with Alzheimer's disease (AD) and en- 
cephalopathies such as bovine spongiform encephalopathy (BSE, "mad cow" disease) in 
animals and Creutzfeld-Jakob disease (CJD) in humans. The brain of people with the 
memory disorder are studded with abnormal structures called plaques, which are made out 
of amyloid fibers. In this case, the aggregate is built out of a peptide containing from 41 to 
43 residues, called /5-amyloid (A/3). This peptide is produced by the action of an enzyme, 
the /3-secretase which cleaves a much larger protein known as the amyloid precursor protein 
(APP). This process takes place in everyone, not just people with AD. But people with 
AD have an increased A/? production. It has been suggested that this excess /3-amyloid 
production may lead to fibril formation because, having higher concentration of protein will 
increase the likelihood that any partially folded intermediates will be able to attach to each 
other and aggregate. 

Recent studies have suggested that oligomeric fibrillation intermediates (protofibrils) 
rather than fibrils themselves are pathogenic, but the mechanism by which they cause neu- 
ronal death remains a mystery. The possibility that a molecular species other than the 
amyloid fibril could be pathogenic is testified by the fact that oligomeric species rich in 
/3-sheet structure (protofibrils) have been found to be discrete intermediates in the fibril- 
lization of /5-amyloid (A/5) in vitra^. Furthermore, it has been shown thaf^ mutant amyloid 
proteins associated with familial Alzheimer's and Parkinson's diseases form morphologically 
indistinguishable annular protofibrils that resemble a class of pore-forming bacterial toxins, 
that is, toxins which puncture host cell membranes, suggesting that inappropriate membrane 
permeabilization might be the cause of cell dysfunction and even cell death in amyloid dis- 
eases. Observations which suggest that that an intermediate protofibril might be pathogenic 
and be "detoxified" by conversion to a fibril, is furthermore suggested by the observed lack 
of correlation between the quantity of fibrillar deposits at autopsy and the clinical severity 
of Alzheimer's or Parkinson's diseases; the fact that transgenic mouse models of these con- 
ditions have disease-like phenotypes before fibrillar deposits can be detected; the fact that 
non-fibrillar Aj3 oligomers are toxic in cell culture^'^ and have activity in vivo^. 



The analysis of the molecular structure of the fibrils, specially of protofibrils, could be 
useful to understand the aggregation mechanism. Unfortunately, the insoluble and massive 
character of these systems rules out the possibility of investigating its molecular structure 
with conventional experimental techniques. X-ray diffraction experiments have shown that 
these fibrils are rich in /5-structures, and that these structure are perpendicular to the axis 
of the fibrilS,, whereas the shape of the same peptide in its biological active conformation is 
an a-helij*^. Through electron scanning and atomic force microscopy it has been possible 
to measure the diameter of the fibrils. The results range from 25A^° to 43^^^. Filaments of 
90A in diameter have also been observed. The are built by the assembly of single fibrilsi^. 

Given this state of affairs, it could prove useful to study the structure of protofibrils 
and eventually of fibril filaments and the mechanism of formation by molecular dynamics 
(MD) simulations. However, classical simulations which describe explicitly all atoms of the 
protein and of the solvent can be carried out over periods of time of only few hundreds of 
nanoseconds, while the internal motion of a protein and the displacement of its centre of 
mass takes from microseconds to milliseconds. On the other hand, simplified models which 
neglect important details of the amino acids and of the solvent are not helpful to study the 
problem at hand, which strongly depends on the chemical details of the molecular species 
involved in the process. 

The approach which we have preferred to follow consists in performing short full-atom 
molecular dynamics simulations of the conformational evolution in solvent of the peptides, 
either isolated or interacting with few other identical peptides, for afterwards use the numer- 
ical data obtained to build a thermodynamic model for the formation of larger aggregates. 

I. THE STARTING POINT: RESULTS FROM MD SIMULATIONS 

Due to the large computational cost of simulating the dynamics of the whole 
A/31-42 amyloid peptide, use was made of a fragment of 17 residues, the A/512- 
28 amyloid peptide, which has been shown to form fibrils structurally similar to 
those of the whole peptide. The amminoacidic sequence of this fragment is: 



VI 2 HHQK LVFFA ED V GSNK28 



where the most hydrophobic residues have been boxed and the first and last amino acids 



numbered according to their position in the full A/31-42 peptide. 

In order to find the thermodynamically most stable conformations, we performed 100 ns 
simulations at 295K and 320K, starting from different conformations of the peptide. Molec- 
ular dynamics was implemented through Gromacs software^^, with explicit representation of 
all the atoms of the peptide and of the solvent. The equations of motion are solved making 
use of a leap-frog algorithm in a periodic cell, long-range interaction are treated in the 
particle mesh Ewald scheme, while temperature and pressure are kept fixed through a weak 
coupling with an external bath. 

Simulations starting from an a-helical conformation (cf. Fig. ^ IH state), which is the 
crystallografic native conformation of the peptide when embededded in the whole protein, 
show that the helical features are lost in the first few nanoseconds at both temperatures 
analyzed. These and other simulations, carried out starting from elongated conformations, 
lead to the stabilization first of a transient compact state (Fig. ^ IL state) and then of a 
hairpin-like state (Fig. ^ IB state), both characterized by a loop in the region involving 
residues A21-E22-D23-V24. 

The hairpin-like state is stabilized by the hydrogen bonds, salt bridges and hydrophobic 
interaction among amino acids in the central region of the peptide (from K16 to D23). 
This concentration of strongly interacting amino acids makes one suspect that this region is 
important for the folding of the peptide to its native (helical) state when embedded in the 
APP protein, perhaps belonging to some local elementary structure (LES)^^*^^*^^^ which, 
assemblying to the other LES into the folding nucleus, is responsible for the correct and fast 
folding of the whole protein. This fact would make the effect of the state IB of the cleaved 
A-beta peptides even more dramatic than what expected. In fact, not only these peptides 
aggregate together into amyloid fibrils (as we shall see in the following), but they are likely 
to interfere with the folding process of entire APP proteins, according to the mechanism 
discussed in ref.-^. 

Note that the simulations performed in the present work, although computationally de- 
manding (each of the 100 ns run took 6 weeks on a cluster of pentium nodes), are too short 
to explore the full conformational space of a peptide made out of 17 amino acids. On the 
other hand, the simulations gave strong indication about the stability of the IB state and 
the unstability of the IH state, indications which are clearly supported by the physical un- 
derstanding of the factors which stabilize such states (cf. Section Ila) and by the comparison 



of the predictions of the present model with the available experiments (cf. Section IV). Of 
course these simulations cannot rule out the presence of other thermodynamically important 
states (nor can any MD simulation, nonetheless how long, do). However, these other states 
would have to be rich in /3-structures, as experimentally known^° (cf. also Sect. IV), and 
neither physical intuition suggests /^-conformations other than those discussed above to be 
stable, nor it is likely that such conformation would not have emerged from the simulations 
(the number of different /5-conformations which could be built with a peptide of 17 amino 
acids being quite small). 

Taking advantage of the knowledge of thermodynamically relevant monomeric states, we 
performed 30 ns MD simulations of oligomers (dimers, tetramers and octamers), starting 
from different guesses of oligomeric structures that the monomeric states discussed above 
can assume. The most stable conformations found from the analysis of the trajectories are 
displayed in Fig. El It is worth to emphasise that the MD simulations are not meant to 
explore the full conformational space of such oligomers, but only to search locally around 
the neighbourhood of the guessed oligomeric conformation. Consequently, the results are 
strongly dependent on the initial conformations. Such initial conditions are obtained combin- 
ing the monomeric states discussed above in oligomeric conformations which are physically 
meaningful in order to optimize the interaction between monomers (e.g., state nC opti- 
mizes hydrogen bonds). The main approximation done is, therefore, to assume that optimal 
oligomeric conformations are spatial arrangements of optimal monomeric conformations, ne- 
glecting the fact that the stabilization of monomers into an oligomeric state could strongly 
change the features of the building blocks. The comparison of the results of the model with 
the experimental findings, and especially with the structural information known for fibrils 
and micelles suggests that the approximation is not unreasonable. 

II. THERMODYNAMICAL MODEL: INFINITE DILUTION APPROXIMATION 

We first analyze the thermodynamics of monomeric states, which can be viewed as the 
states relevant in the approximation in which the system is infinitely diluted, so that the 
(infinite) translational entropy prevent monomers from binding together into aggregates. 



A. Free energy of monomeric states 

The interest in the a-hehcal state (Fig. ^ "IH") rehes on the fact that, although this 
conformation, according to our simulations, is not stable, it is the crystallographic struc- 
ture for this sequence when forming part of the A/31-42 wild-type protein under biological 
conditions. 

The internal energy is calculated by averaging the potential energy (i.e.. Coulomb and 
Van der Waals contribution) in a 2 ns time interval at 295K, short enough for the fragment 
to remain in the elical conformation. We obtaini^ Eih = —887. 

In the same MD-short trajectory, the entropy of the sidechain is also calculated. Since 
the motion of the sidechain takes place on a time scale which is much shorter than that 
of the simulation, and since the backbone is essentially fixed during this time interval, it is 
possible to assume that the sidechain is at equilibrium. Thus, one can obtain the free energy 
from the logarithm of the partition function, 

F'"^^^^"''' = -T log < exp(— ) > . (1) 

The brackets <> indicate the thermodynamical average, which in this case gives rise to the 
partition function, and is set equal to the time average by virtue of the ergodic theorem. 
Subtracting from the free energy the average interaction energy, one obtains the entropy 
associated with the motion of the sidechain 

{TS'''^^)iH = -88.6. (2) 

Since the state IH of A/312-28 corresponds to a unique conformation of the backbone 
(the calculated RMSD being lower than 2 A), separated from the other conformations by 
a sharp transitions (for details see ref.— ), the entropy associated with the backbone is set 
equal to zero. 

The solvation free energy AGsoiv, i-e. the difference in free energyi^ associated with the 
interaction between the solvent and the A/312-28 peptide in an elongated (random) and in 
the IH conformation is the result of two effects: a decrease of the free energy due to the fact 
that the hydrophobic side chains become partially hidden to the solvent and an increase of 
the free energy due to the fact that the new compact IH state some hydrogen bonds between 
hydrophobic residues and water present in the unfolded conformation are not allowed in the 



folded conformation. The quantity AGsoiv for the IH state cannot be obtained directly 
from the simulations because, within the short time over which the MD simulations are 
carried out, the motion of the solvent molecules cannot be approximated as at equilibrium. 
Instead, we obtain the numerical value of AGsoiv from the FOLD-X^° approximated force 
field, which gives to each residue in contact with the solvent a contribution to the free energy 
proportional to the exposed surface to the solvent and the degree of hydrophobicity of the 
amino acid. One obtains 

(AG"°'")iH = 62.3 (3) 

The total free energy for the state IH at 295K is obtained from the sum of all the 
contributions discussed above and gives 

F,H = -736.1. (4) 

We choose the state IH as reference for the following calculations, and accordingly we set 
AFiH = 0. Note that the choice of the free energy of the reference state, provided that it 
is the same for all states, is purely conventional, and has no consequence on the following 
calculation. 

The free energy of the hairpin-like state IB calculated in a similar way, giving Eib = 
-881.1, TSf^^ = -100.0 and {AG)ib = 45.8. Unlike the previous case, the state IB displays 
a non-zero entropy associated with the backbone. This is because the harpin defines a 
flexible plane which can bend perpendicularly to the direction of the hydrogen bonds (cf. 
Fig. 121). Since a direct calculation of this contribution directly from MD simulations is out 
of reach, it was estimated by means of a simple model. If each pair of residues connected 
by hydrogen bonds gives rise to r] conformations, the total number of conformations that an 
hairpin built out of A^ residues can assume is r/(^~^)/2. Consequently 

S<^'^^-i^ = (N-l)/2-\nT]. (5) 

The value of r] is not precisely known, but it can be estimated in keeping with the fact that 
in an unconstrained polypetide each residues can assume ~ 9 conformations, and that, since 
the residues in an hairpin are bound by hydrogen bonds, they will display approximately 
half of the degrees of freedom of the unconstrained chain, so that r] = 9^/^ = 3. Within this 
approximation TSf]^^"^ = 21.6 at 295K, leading to Fib = —756.9, and, with respect to the 
reference state IH, AFi^ = Fib - Fih = -20.8. 



Another state which show a sizable stabihty is the state IL. As in the case of the IH 
state, this is associated with a unique backbone conformation, so that 5*^^"*" = 0. From the 
simulations one obtains Eil = -1055.7, TSl^"" = -237.4 and AGff = 66.2, the total free 
energy summing to Fn = —752.1 {AFn = —16.0 with respect to the reference state IH). 

In order to account correctly for the thermodynamics of the system, one has to include 
in the calculations also the set of denaturated states ("lU"). Although not displaying low 
interaction energy, they display a large entropic contribution, and consequently become more 
relevant the higher is the temperature. Also in this case, it is not possible to obtain the free 
energy from MD simulations, which are by far too short to provide an exaustive search in 
conformational space. Instead, we will describe the states of type lU with the help of the 
Random Energy Model^l. Assuming that the interaction between residues in denaturated 
states are random and uncorrelated, and that the chain is long enough that the Central 
Limit Theorem holds, the energy distribution for states of type lU (density of states) is 



g{E) = 7^A;-iexp 



{E - N,eo 



i2 



(6) 



where A^ is the length of the chain, 7 is the number of conformations per residue, k = 
(27rA^cO"^)^ is the normalization constant, Nc being the number of two-body interactions in 
the chain, eo and a are the average and the standard deviation of the two-body interactions. 

In the present case, A^ = 17 and we shall assume that the number of interaction contacts 
Nc is constant for all conformations in the state lU and equal to the average value found in 
the simulations, that is A^c = 16. 

From Eq. (jU)) one finds that the entropy of the chain, defined as In g{E) (aside from an 
overall constant) is given by 



s''-{E) = A^.iny - '\': ' (7) 



. CL 



in the case where E > Ec = Nc,eQ — Nca{2 log 7')-*^/^, with 7' = 7^/^^ U E < E^ the argument 
of the exponential in Eq. (jHl) diverges to — cxd linearly with A^ and there are essentially no 
states with energy below Ec- Making use of the relation dS/dE = 1/T one also finds 

<^>=-^ + Ar,eo. (8) 

In order to calculate the numerical values of eo and a, we have identified the states lU as 
those parts of the trajectories corresponding to conformations structurally differenlj^^ from 



state IB. From the analysis of the trajectorieslS it emerges that these states are separated 
from state lU by strong variations in all thermodynamical quantities, and consequently we 
can interpret lU as a metastable state, separated from the others by a free-energy barrier. 
The associated Boltzmann-like energy probability p{E) = g{E)exp{—E/T)/Z, the entropy 
being that found in Eq. ((7)). This gives 

Finding the concavity of p{E) and the value of Ec from the MD simulation, from Eq. ^ 
and from the definition of E^ one finds a = 12.2 and^'^ Aeo = eq — Fih/N^. = eo + 46 = 14.8. 
From the analysis of the MD trajectories we have also found that TASl^f'^ = +1.5. The 
solvation free energy has been calculated with FOLD-X as the average of the solvation 
energies of ten conformations picked at random from the simulations. The total free energy 
is found as logarithm of the partition function. This is calculated integrating the free energy 
over all possible energies associated with the state lU, that is from Ec to +oo. This gives 

exp(AF,,) = exp fiVlog7 + A^-^ - ^ - ^ + Ua^ (^ - ;^f 1 x 



T 2a^ 2 \ a^ T 

1 ^ 
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/ AE^+(NaHA.,y--T-^)) \-\ 

At a temperature of 295K this relation gives AFm = —13.4. The results discussed in the 
present Section are summarized in Table lU 



B. Population of monomeric states 

From the knowledge of the free energy of all the thermodynamicall relevant states, it is 
possible to calculate their equilibrium population. The partition function reads 

Z = 1 + exp(-AFiB/T) + exp(-AFiL/T) + exp{-AFiu/T). (11) 

The most populated state at 295K results to be the /3-hairpin (IB), whose relative prob- 
ability of occupation is 

exp(-AFiB/r) , , 

PiB = ^^ ^ ' ' = 0.91, (12) 

while the IH state displays a negligible relative probability. 

A systematic analysis of relative state populations as a function of temperature poses some 
problems. First, an analitic expression of the solvation energy with respect to temperature 



is not available. Moreover, extrapolating at high temperature thermodynamical quantities 

found at room temperature is not straightforward. For instance, as the temperature rises, the 

solvent undergoes a phase transition from liquid to vapour, and the extrapolation becomes 

meaningless. Moreover, the average interaction energy and the side chain entropy are only 

calculated at 295K and 320K. 

In any case, between 273K and 373K the solvation free energy displays variations below 

30%2i and one can attempt an extrapolation. The free energy associated with the state 

lU, calculated from Eq. (fTUI) . is showed in the inset of Fig. |3]and is well approximated by 

AFiu = — 16.5T + 29.0. The relative population of the harpin state IB is then 

^ exp(lM + 4.28) 

P^^ 1 + exp(ifi - 59.5) + exp(iM + 4.28) + exp(^ + 16.5) ' ^ ^ 

and is displayed in Fig. HI 

The fact that p{lB) increases with temperature is consistent with the experimental data 

obtained by circular dichroisn>iS and is not trivial, as, in general, physical systems loose 

symmetry as the temperature is increased. This particular behaviour is consequence of two 

facts: 1) the denaturated state lU has a free energy quite larger than the state IB, so that 

it starts competing with the state IB only at high temperatures, 2) the state IB displays 

a conspicuous entropy term associated with thebackbone degrees of freedom on the plane 

defined by the hairpin (cf. Fig. E)), term which is responsible for the decreasing the free 

energy as temperature is increased. 

III. THERMODYNAMICAL MODEL: AGGREGATION 

A solution of /3-amyloid peptides contains a large number of monomers (i.e., peptides) 
which can interact and give rise to conformations which are more complicated than those 
discussed in the infinite dilution limit. Typically, the concentration of peptides is of the 
order of 100/iM (= 10"^ mol/1), the volume of a cuvette is \^ = 1 ml, so the number of 
monomers is of the order of 10^^. 

A. Free energy of the aggregates 

The free energy of the multimeric states is calculated in a way similar to that of monomers 
(see previous Section), by including the interaction energy between different monomers 

10 



and the entropy associated with the translational and rotational degrees of freedom of the 
monomers. The inter-monomer energy is calculated directly from the MD simulations (in 
the same way as the intra-monomer energy), while the new entropic term is approximated 

as 



S"''-''''''Hn) =ln 



V - {n-l)v' 



in — 1) In 



47r3 



(14) 



Au 
where n is the number of monomers present in the volume V. The first term is associated 

with the translational motion of the centre of mass of each monomer, whose Van der Waals 
volume is f = 2nm^— in a volume V, while the latter term accounts for the rotational 
degrees of freedom. The quantity Au is the product of the amplitudes of the Euler angles 
available to each pair of bound monomers and it takes a value of the order of Auj ^ 8 ■ 10~^ 
(see Appendix). Notice that in the first term of Eq. (jl4|) . the term (n — l)v is not negligible 
as compared to V only at concentrations larger than lOM, concentrations which are non- 
biological (experimental concentrations are of the order of lOO/xM). Consequently, we can 
assume V" > (ri - l)f , and approximate Eq. dHj) as ^™t-t™'^'(n) ^ (jvot-trasi^ ^ ^rot-trasi^ 
where we define a™*"*^'"^' = 9.6 and ^5°*"*^''"' = 38.7. 

We define as reference state for an aggregate of n monomers the set "nH" of conformations 
composed on n non-interacting monomers in helical state (IH), whose free energy is FnH = 
nFiH - T5™*-*™"'. 

The states relevant for aggregations are chosen on the basis of the data resulting from the 
MD simulations discussed in Section I. In particular, the free energies associated with some 
of the states displayed in Fig. |21 are listed in Table |nj The state displaying two interacting 
helices (labelled "2K" to distinguish it from the state "2H" displayin two non-interacting 
helices) has a free energy much higher than the other states. Consequently, we will not 
take it into account, nor the multimeric states built starting from it. We will indicate each 
kind of aggregate with a number, which indicate how many monomers which build out that 
aggregate, and a label, which indicates its shape. For example, I8P2 rneans an aggregate 
built out of 18 monomers arranged as indicated in Fig. |21 

The states nPi, nP2, nP4 and nC (see Fig. ^ are those which, for large n, resemble (proto) 
fibrils. Under the assumption that each layer of the fibril (each section (layer) composed of 
1 monomer in the case of states nPi and nC, of two monomers in the case of nP2 and of 
four monomers in the case of nP4) interacts only with its nearest neighbours and, under the 
approximation that the solvation free energy is additive, the total free energy for a given 

11 



kind of aggregate is linear functions in n. The result for the state of kind nk (where nk can 
be nPi, nP2, nP4 and nC) is 

The quantities e™°"(fc), e*"*(/i;), cr**'^'^(/c), cr'^'*"*"(A;) and p^°'^(fc) are the energy densities listed 
in Table IIIIl obtained from linear extrapolation of the data listed in Tables H] and |n] In 
particular, e^°^{k) is the density of internal energy and e^^^{k) is the density of interaction 
energy between monomers. The translational entropy is that given in Eq. ()14|) . The quantity 
e^Q^{k) takes into account that a single layer does not produce any interaction energy (i.e., 
the interaction energy of the state nPi is proportional to n — 1, not to n). 

Other thermodynamically important states are those which display the hydrophobic head 
of the IB conformation towards the centre of the aggregate, in a way similar to lipidic 
micelles (see Fig. 12)). They are less ordered than fibrils, but gain free energy by hiding the 
hydrophobic residues from the solvent. 

The density of internal energy per monomer is not constant, but increases when going 
from dimeric to octameric micelles (being 14 for 2M, 24 for 4M and 49.6 for 8M). This could 
be due to the increasing difficulty of optimizing the contacts between amino acids in a larger, 
more frustrated system. Since we are interested in extrapolating the internal energy for large 
n, we will use the largest value, setting E^m = 49. 6n. The inter-monomer interaction energy, 
the solvation free energy and the sidechain entropy of the dimeric, tetrameric and octameric 
micelles are displayed in Fig. El While i?*"* is almost linear with n, AG^"'"'" and TS^'^'^^ 
are only approximately linear. This is consistent with the fact that the solvation energy is 
not purely additive-^, and the kink in the sidechain entropy appears to be a consequence 
of the diminished hydrophobic compaction for large n (cf. Figs. Eb and c). For the sake 
of simplicity we approximate the scaling of AG^°''^ and of TS'***'^^ by linear functions. The 
general expression of these quantities, once subtracted the energies of the reference state 
nH, lead to 



■^^ 



E'^j^^j = -315.7n + 532.4, 
AG'^i^ = 30.6 - 18.6ri, 

S^t = 1.9n - 10.9 . (16) 

As the size of the micelles increases, an empty volume is created in their interior. The free 
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energy AG™'' needed to create this cavity is, for cavities not too large with respect to the 
size of the solvent molecules, purely entropic and can be related to the probability that 
density fluctuations produce an empty volume of the same size in bulk solvent. Computer 
simulations^^ have shown that the probability distribution for such fluctuations is exp(— ai?^) 
with a = 58nm~'^ and R the radius of the cavity. Approximating the dependence of the 
surface of the cavity on the number of monomers as i?^ = 6 ■ 10~'^{n — 2) nm^, one obtains 

AG^'^H =2.6T(n-2)2 (17) 

The free energy of the state nM is given by the sum of all the terms discussed sofar, that 
is, 

A rp A 77" I rr'int , A /^solv rji \ nside rji \ richain , rji ntrasl , \ /^cav 

i^^nM = i^i^nm + -C/„Af + ^^ nM ~ ^ ^"^nM ~ ^ ^"^nM + -L b + iXLr 

= 49.6n - 315.772 + 532.4 + 30.6 - lS.6n - 1.9Tn + 

+ 10.9T - S.STn + 48.3T + 9.6T{n - 1) + 2.6T{n - 2f = 

= 2.6T{n - 2f + (-1.1T - 284.7)n + 563.0 + 49.6T. (18) 

Other potentially interesting states are nB, composed of non-interacting monomers in 
the state IB, and nlJ, a disorder clamp of n monomers. The free energy of the state nB is 
just n times the free energy of the state IB, that is 

AF^B = -21.1n, (19) 

while that of state nlJ can be found from the random energy model, using again Eq. ()10|). 
substituting N^ by the quantity nN^. To be noted that, according to the present calculations, 
the state analyzed in ref.— gives rise to an oligomer whose free energy is —14.5, much higher 
than the free energies associated to the states displayed in Fig. |21 and consequently will be 
neglected in the following. 

The summary of the free energies associated with the different states and the different 
sizes is displayed in Fig. IHlfor the case of T = 295K. The state nP4 with large n displays the 
lowest free energy minimum, while the state nM appears as a metastable state for small n. 

In a solution of a number N of monomers, the state of each monomer can be described 
as a point in a two-dimensional system. The first coordinate of a point, k, indicates the 
kind of structure in which the monomer is embedded (i.e., micelle (M), disordered clump 
(U), etc.), while the second coordinate Ik indicates the size of that structure (i.e., a micelle 

13 



composed of 2, 3, ... monomers, etc.). The state of the whole system is fully determined by 
the coordinates {k,lk} of each monomer. 

All the calculations performed in the present Section can be summarized in the free energy 
density ({kji^), which is the free energy associated with a single monomer embedded in a 
state of kind k composed, overall, of /^ monomers. This quantity reads 

C{kM) = \\ /' ,, (20) 

where ft = e'^°^{k)+e'''\k)-Ta'^'^^{k)-Ta''''"'^'^{k)+p'"^''{k)W'''^ contains the contribution 
proportional to the size of the aggregate, Sk = eg"* ~ Ta^^"'^'' contains the contribution 
proportional to the number of different aggregates and Hm = 2.6T is the factor associated 
to the quadratic contribution of the micelle (state "M"). 

B. Population of aggregates 

From the knowledge of the free energies associated with the different states of aggregation, 
it is possible to study how a number N of monomers occupy the different states. In doing 
this, we will neglect the interaction between different aggregates. The study of this kind of 
interaction is certainly an interesting problem, but, involving the diffusion of large objects, 
takes place on a time scale which is longer than that of monomers and oligomers. 

In order to use the expression we found for the free energies to obtain information about 
the thermodynamics of a solution of peptides monomers, we employ the occupation-number 
formalism which is often used to describe quantum many-body systems. In other words, the 
state of the whole system is characterized by the set of numbers {n{k,lk)} which indicate 
how many of the N monomers are in the state k of size Ik- The total free energy of the 
system is then 

+ 00 

F{{n{k, Ik)}) = E E ^(k, lk)ak, h), (21) 

k Ik 

where the sum is performed over all possible states of aggregation and all possible sizes of 
the aggregates, and ({k, Ik) is the associate free energy density, as found in Eq. EOl 

It is convenient to calculate the partition fuction in the grand canonical ensemble, since 
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a constrain in the number of monomers is difficult to handle analitically, which gives 

1 



Z 



E 



exp 



-/5EE^(^.^fc)(C(^>4)-/w) 



k li. 



(22) 



where (3 is the inverse temperature and /i is the chemical potential, responsible for setting 
the average number of monomers in the system. The factor before the exponential takes 
into account the fact that monomers are indistinguishable, and consequently the swap of 
two of them does not produce a new state of the system. The partition function can be seen 
as a geometric series, which can be summed up giving 



Z =Y[exp [exp {-(3{Cik, h) - 1^))] ■ 



(23) 



li.lh 



The grancanonical potential can be found as —[3 ^ log Z and, substituting the expression 
of C,{k, Ik) given in Eq. 1201 gives 



0(A/i) 



-/3-1 ^ exp 



I'k 



- /? ^E^^p 



V l-M I'M 



/^ 



(24) 



One can then obtain the average number of monomers occupying a given state k from 



-/3(A-m) 



< rik >-- 



dfk 



exp 



1/2 



if A;^M 
if fc = M, 



(25) 



-P ( /m - 4/iM - /^ + 2hM (4 + f^j' 

where one has summed the second expression by saddle point technique. Performing the 
higher derivatives, one could obtain all the moments of the distribution. In order to obtain 
the value of the chemical potential /i for a given total number of monomers A^ (which one 
usually knows, having prepared the solution at a given concentration), we set J2k < ^k >= 
N, obtaining 



/i = /?-'log- 



(26) 



g-,3(/M-4hM+2hM(4+SM//»M)i/2 ^ j-kiNe-i^'^/^ + pSkEi{-f3sk/N)) ' 

being Ei the exponential integral Ei{x) = — J^^dt e^^/t. 

It is now possible to describe the thermodynamics of a solution of A^ interacting 
monomers. Using Eq. ()25|1 together with the chemical potential found in Eq. (J2EI), it 
is possible to define a probability for a monomer to be in state k as the average number of 
monomers in the state k over the total number of monomers, namely p{k) =< n^ > /N. 
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The chemical potential /i as a function of the total number of particles of the solution is 
shown in Fig. UH^), for temperature T = 2.5 (equal to 295K) and T = 3.5 (equal to 420K). 
The condition of local stability requires that d^/dN > 0, condition which is satisfied, 
for both temperatures, for A^ ^ 100 or for N —>■ oo. Under the condition N ^ 100 the 
equilibrium state is the micelle (see Fig. U![h) ), while above that number of monomers the 
state nP4 becomes overwhelming, the other states palying essentially no role. As it is clear 
from Figs. Wic) and (d), the population of the states nM and nP4 depends weakly on the 
temperature. 

It is worth to highlight the fact that, although the free energy of micelles is approximately 
linear with their length, it displays a nontrivial behaviour like the crossover shown in Fig. 
7(b) and (c). The reason for this behaviour is associated with the fact that the probability 
of the state nP4 depends not only on the associated free energy AFnP4, but also on the 
partition function, and consequently on the free energy of all other states, including that 
of micelles, which is nonlinear in the number of monomers which builds out the structure. 
This fact emphasises the important role micelles have on the behaviour of the system. 

IV. DISCUSSION 

The unusual features of the peptide A/?-amyloid 12-28, which encodes for an alpha-helical 
structure when embedded in the whole protein, while it folds to a well-defined beta-hairpin 
when cleaved, are related to the specific chemistry of the peptide and are discussed in detail 
elsewherei^. The thermodynamics of the isolated peptide is non trivial, in that, in the range 
of biological temperatures, it gets more structured, the higher is the temperature. This is 
clear from Fig. EJ which displays the fractional population of the /5-hairpin state IB growing 
as function of the temperature up to ~ 315K. This behaviour is due to the fact that the 
stability of the state IB, unlike the stability of its competitors (IH and IL), relies heavily 
on its chain entropy. The higher the temperature is, the more important the entropic term 
becomes. This behaviour has been measured through circular dichroisn>iS. In this reference 
the authors report percentages of /9-structures of 30% at 278K, of 50% at 308K and 70% at 
328K. 

At temperatures higher than 315K the unfolded state lU, which is also entropically sta- 
bilized, gets importance, and consequently the population of the state IB starts decreasing. 
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At high temperatures the state lU becomes dominant and the peptides gets completely un- 
structured. Note that, anyway, the predictions of the present model cannot go further than 
the range of biological temperatures, because the solvent phase transitions which take place 
at higher temperatures make any extrapolation of the free energy meaningless. 

The present model also indicates that a single infinite-long fibril of kind nP4 is the true 
ground state of the system. This is not consistent with atomic force microscopy experiments, 
which describe a population of fibrils of heterogeneous length^S.. The reason for this result 
is quite simple. The stabilization energy of small oligomers is of the order of T, while 
that of fibrils is of the order of 10^ — lO^T, and consequently, once they are formed, they 
are not able to separate. Moreover, it is reasonable to assume that, as fibrils grow, their 
diffusion coefficient decreases. As a consequence, if several incipient fibrils are present at 
a given time, they cannot but grow, competing for the available monomers in solution, 
until they have completely depleted the solution. The result of such a process is a number 
of fibrils of heterogeneous length, which are "frozen" in the sense that they represent an 
out-of-equilibrium state strongly dependent on the initial condition (it is a case of "kinetic 
partitioning"). An important consequence of this fact is that most likely the system will 
never reach its thermodynamical equilibrium, while the experimentally observed states will 
be metastable states, determined by the initial events of the oligomerization process. 

Generalizing this idea, one could speculate that also the inner structure of fibrils is depen- 
dent on the initial stages of the dynamics. In fact, in the literature are described experiments 
where /3-amyloid peptides of different length, but all including the central hydrophobic re- 
gion, aggregate in fibrils^S of different diameters and different section shape. In ref.-^fl pep- 
tide /5A10-35 is prepared at 273K, concentration 0.2 mM and pH 5.6 or 7.4, and fibrils 

of diameter 80 90A are observed. In ref.— the peptide /3A11-25 is prepared at 6 mM 

concentration, while temperature and pH are not specified. They observe fibrils displaying 
an elongated section whose longer axis is 30A. The peptide /5A16-22, prepared at 299K, 
ImM concentration and pH 7, builds fibrils characterized by antiparallel /J-stranda^. 

These three kinds of fibrils observed experimentally are structurally similar to the states 
nP4, nP2 and nC, respectively, states which display a similar stabilization free energy (cf. 
Fig. ini). Consequently, if at the initial stages an incipient fibril of a given type (e.g., nP4, 
nP2 or nC) is favoured, that kind will become dominant even if it does not correspond to 
the global free energy minimum. Within this context, the abundance of states IB among 
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monomeric states at high temperature could explain the uprise of fibrils of kind nC described 
in ref.— , in that these monomers can assemble into nC fibrils without encountering any major 
free energy barrier. The process is then driven by diffusion, and the time needed to start 
fibrils is proportional to the square concentration of free monomers. At lower temperatures, 
as in the case of the experiment described in ref.—, the monomeric peptides that could be 
in solution are not in the IB state, and consequently the fibrillation mechanism could follow 
another pathway, involving more complex elementary units, namely micelles, and leading to 
fibrils with a larger section. 

If fibrillogenesis is strongly dependent on the initial conditions, then micelles will play 
an important role. In fact, micelles are local equilibrium states which are stable when the 
number of monomers in the system is low. At the initial stages of the dynamics (after t 
seconds), the effective volume that each monomer can span is Vefj = {Dt)^^"^, where D is 
the diffusion coefficient, of the order of 10"^*^ m? / s^^. Consequently, the effective number of 
monomers which is relevant for the out-of-equilibrium thermodynamics is A^e// = c^DtY'"^, 
where c is the concentration. For a concentration c = 100 /xM one obtains N^ff = 100, 
compatible with the stability of micelles (cf. Fig. Ej), one obtains t = 10~^s. This means 
that if incipient fibrils are not created in the first millisecond from the direct assembly of 
monomers, the free monomers and oligomers will aggregate into micelles, whose size is of 
the order of Im = (4 + SM/hM^^^ ~ 15 (cf. Eq. ((2S))). This is consistent with the results 
obtained by Benedek and coworkers by small angle neutron scattering experiments^"^ and 
with the model they propose, although for the whole peptide A/51-40. Once micelles are 
formed, they are remarkably stable (see Fig. ^. 

The important role of micelles is to deplete free monomers and small oligomers from 
the solution, which otherwise could build thin fibrils (of kind nC and nPi). Fibrils are 
then cretated not directly from monomers, but as transformation of micelles, as suggested 
in ref.-^^*^. The free energy barrier to go from a micelle to a fibril (e.g., of kind nP4) 
describes the internal rearrangement of monomers and cannot be derived by the present 
model. Anyway, it suggests that the number of monomers in fibril seeds, that is the shortest 
stable fibril, is of the order of Im ~ 15. 

Micelles are then a finite-size effect which plays a fundamental role in the fibrillogenesis 
mechanism. This turns out not to be a real nucleation process. In fact, we suggest that 
fibrils arise either by spontaneous assembly of monomers without any major free energy 
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barrier, or by transformation of a micelle-like metastable state into a fibril of equal length, 
depending on the conditions of the solution. 

V. CONCLUSIONS 

We make use of MD simulations to analyze the mechanism which makes A/9-fragments 
aggregate into fibrils. Since a brute-force MD approach is neither computationally feasible, 
nor leads to a real insight into the aggregation thermodynamics, we use MD trajectories as a 
starting point in order to detect those states which are thermodynamically important and to 
characterize their free energy. Making use of these informations, we build a simple physical 
model which teaches us some important facts about the A/9-fragments: 1) in the range of 
biological temperatures, the peptides is more stabilized into a j3 structure, the higher is 
the temperature; 2) micelles are an important metastable state which is populated at the 
beginning of the dynamical process and subtract monomers and small oligomers from the 
solution; 3) fibrils arise either by direct assembly of the few free monomers (at relatively 
high temperature) or by transformation of a micelle into a fibril of the same size. 

APPENDIX A: ESTIMATE FOR THE ROTATIONAL ENTROPY 

If one of the monomers is selected to set the reference axes, each of the A^^ — 1 other 
monomers, when not bound, can rotate freely in space, the product the amplitudes of its 
three Euler angles being An^. When bound to another monomer, the three angles are 
constrained. To evaluate to which extent they are constrained, we calculate the average 
vibration length of the bond distances between two monomers and obtain the angular am- 
plitude by trigonometric transformations. For example, the length of the longer axis of the 
IB conformation is ~ 20A and the average vibration length of the intra-monomer bonds 
is 2A. Consequently, the angular amplitude associated to the vibration of the two bound 
monomers along their longer axis is tan~^(2/20) = 0.1. Similarly, one finds for the other two 
angular amplitudes the values 0.4 and 0.2 (assuming that the width of the IB conformation 
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is ~ 5A). The product of the three angular amphtudes give Auj = 8-10 
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FIG. 1: The conformation of some relevant monomeric states. 

FIG. 2: The conformation of some relevant oligomeric states. Each state is built out of a number n 
of monomers, each of them being in the state IB. States nPi, nP2, nP4 and nC display translational 
symmetry in the direction perpendicular to the plane defined by the hairpin, while state nM displays 
discrete rotational symmetry. 

FIG. 3: A sketch of the backbone degrees of freedom associated with the state IB. 

FIG. 4: The relative population of state IB as a function of temperature. In the inset, the 
dependence of the free energy of the state lU as a function of T. 

FIG. 5: The internal energy (a), the total solvatation free energy (b) and the total energy associated 
with the sidechain entropy (c) of oligomeric micelles, as a function of the number of monomers. 
These quantities are found from MD simulations at 295 K. 

FIG. 6: The free energies associated with the different aggregation states at T = 295K, displayed 
as a function of the size of the aggregate. 

FIG. 7: (a) The chemical potential as function of the total number N of particles at temperatures 
T=2.5 kJ/mol (= 295K) and T=3.5 kJ/mol (= 420K). (b) The fractional population of states nP4 
and nM at T=2.5 kJ/mol as a function of N. (c) and (d) The fractional population of states nP4 
and nM as a function of temperature, for N = 100 and N = 10000, respectively. 
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State 


AE 


2-^5side 


^Qsolv 


T^ A achain 


AF 


IH 

















IB 


5.9 


-11.4 


-16.4 


21.6 


-20.8 


IL 


-168.7 


-148.8 


3.9 





-16.0 


lU 


48.4 


2.0 


-19.4 


39.3 


-12.3 



TABLE I: The energy terms associated with monomeric states. 



State 


AE 


T?int 


rp aside 


rn achain 


^QSOI 


AF' 


2Pi 


-62.0 


-135.9 


-8.8 





-9.0 


-198.1 


2P2 


-37.6 


-62.7 


-0.8 


44.2 


-24.8 


-168.5 


2C 


-23.6 


-152.2 


+9.4 





-22.6 


-207.8 


2K 


+58 


-124.1 


-32.8 





-15.6 


-49.2 


2M 


+14 


-140.0 


-16.1 


44.2 


-30.5 


-184.9 


4P4 


+7.2 


-153.8 


+12.8 





-70.2 


-229.6 


4Pi 


-128.0 


-293.6 


+4.8 





-20.7 


-446.5 


4M 


+96.0 


-785.2 


-38.8 


88.4 


-15.9 


-754.7 


8M 


+397.6 


-1975.2 


102.4 


176.8 


-127.0 


-1983.8 


8P4 


-150.4 


-801.6 


-18.4 





-135.4 


-567.4 



TABLE II: The free energies of relevant aggregated states with reference to that state nH. The 
results were obtained from MD simulations at 295 K. The quantity AF' is the free energy of 
the associated state, except for the contribution of the translational entropy and of the entropy 
necessary to create a cavity within the micelles. 



State 


^mon 


Ant 


Ant 
^0 


^side 


^chain 


^solv 


nPi 


-31.0 


-135.9 


+135.9 


-1.8 





-4.5 


nPa 


-18.8 


-128.9* 


+195.2* 


-0.16 





-12.4 


nP4 


-18.8 


-161.9 


+481.0 


-0.16 





-16.9 


nC 


-11.8 


-152.2 


+152.2 


1.9 





-11.3 



TABLE III: The energy parameters which generalize the results listed in Table 2 (cf. Eq. (|2 
(*) is found from the combination of states 2P2 and 4Pi. 
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